Probability fold change: A robust computational approach for identifying differentially expressed gene lists
نویسندگان
چکیده
Identifying genes that are differentially expressed under different experimental conditions is a fundamental task in microarray studies. However, different ranking methods generate very different gene lists, and this could profoundly impact follow-up analyses and biological interpretation. Therefore, developing improved ranking methods are critical in microarray data analysis. We developed a new algorithm, the probabilistic fold change (PFC), which ranks genes based on a confidence interval estimate of fold change. We performed extensive testing using multiple benchmark data sources including the MicroArray Quality Control (MAQC) data sets. We corroborated our observations with MAQC data sets using qRT-PCR data sets and Latin square spike-in data sets. Along with PFC, we tested six other popular ranking algorithms including Mean Fold Change (FC), SAM, t-statistic (T), Bayesian-t (BAYT), Intensity-Conditional Fold Change (CFC), and Rank Product (RP). PFC achieved reproducibility and accuracy that are consistently among the best of the seven ranking algorithms while other ranking algorithms would show weakness in some cases. Contrary to common belief, our results demonstrated that statistical accuracy will not translate to biological reproducibility and therefore both quality aspects need to be evaluated.
منابع مشابه
A novel significance score for gene selection and ranking
MOTIVATION When identifying differentially expressed (DE) genes from high-throughput gene expression measurements, we would like to take both statistical significance (such as P-value) and biological relevance (such as fold change) into consideration. In gene set enrichment analysis (GSEA), a score that can combine fold change and P-value together is needed for better gene ranking. RESULTS We...
متن کاملRobust Modeling of Differential Gene Expression Data Using Normal/Independent Distributions: A Bayesian Approach
In this paper, the problem of identifying differentially expressed genes under different conditions using gene expression microarray data, in the presence of outliers, is discussed. For this purpose, the robust modeling of gene expression data using some powerful distributions known as normal/independent distributions is considered. These distributions include the Student's t and normal distrib...
متن کاملA global approach to identify differentially expressed genes in cDNA (two-color) microarray experiments
MOTIVATION Currently most of the methods for identifying differentially expressed genes fall into the category of so called single-gene-analysis, performing hypothesis testing on a gene-by-gene basis. In a single-gene-analysis approach, estimating the variability of each gene is required to determine whether a gene is differentially expressed or not. Poor accuracy of variability estimation make...
متن کاملPIDEX: a Statistical Approach for Screening Differentially Expressed Genes Using Microarray Analysis Author:
Microarray technology is being applied in pharmaceutical drug discovery. A typical experiment is conducted to compare the gene expression profiles under two different conditions and the purpose is to find genes differentially expressed under the conditions. Common practice is to use fold change for detecting differential expression. However, use of fold change can generate many false positive e...
متن کاملProfound Transcriptomic Differences Found between Sperm Samples from Sperm Donors vs. Patients Undergoing Assisted Reproduction Techniques Tends to Disappear after Swim-up Sperm Preparation Technique
Background Although spermatozoa delivers its RNA to oocytes at fertilization, its biological role is not well characterized. Our purpose was to identify the genes differentially and exclusively expressed in sperm samples both before and after the swim-up process in control donors and infertile males with the purpose to identify their functional significance in male fertility. MaterialsAndMethod...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Computer methods and programs in biomedicine
دوره 93 2 شماره
صفحات -
تاریخ انتشار 2009